Density Estimation under Independent Similarly Distributed Sampling Assumptions

نویسندگان

  • Tony Jebara
  • Yingbo Song
  • Kapil Thadani
چکیده

A method is proposed for semiparametric estimation where parametric and nonparametric criteria are exploited in density estimation and unsupervised learning. This is accomplished by making sampling assumptions on a dataset that smoothly interpolate between the extreme of independently distributed (or id) sample data (as in nonparametric kernel density estimators) to the extreme of independent identically distributed (or iid) sample data. This article makes independent similarly distributed (or isd) sampling assumptions and interpolates between these two using a scalar parameter. The parameter controls a Bhattacharyya affinity penalty between pairs of distributions on samples. Surprisingly, the isd method maintains certain consistency and unimodality properties akin to maximum likelihood estimation. The proposed isd scheme is an alternative for handling nonstationarity in data without making drastic hidden variable assumptions which often make estimation difficult and laden with local optima. Experiments in density estimation on a variety of datasets confirm the value of isd over iid estimation, id estimation and mixture modeling.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

برآورد فاصله‌ای ضریب کارایی فرآیند

Several sampling distribution properties of the estimator for Cpk are presented under the assumption that the data are normal, independent and identically distributed. In this paper using these assumptions, the expectation, variance and skewness are calculated by statistical methods, since the sampling distribution is weakly skewed, it is concluded that a symmetric interval estimator for Cpk mi...

متن کامل

Nonparametric adaptive estimation for pure jump Lévy processes

This paper is concerned with nonparametric estimation of the Lévy density of a pure jump Lévy process. The sample path is observed at n discrete instants with fixed sampling interval. We construct a collection of estimators obtained by deconvolution methods and deduced from appropriate estimators of the characteristic function and its first derivative. We obtain a bound for the L-risk, under ge...

متن کامل

On Rates of Convergence for Stochastic Optimization Problems Under Non--Independent and Identically Distributed Sampling

In this paper we discuss the issue of solving stochastic optimization problems by means of sample average approximations. Our focus is on rates of convergence of estimators of optimal solutions and optimal values with respect to the sample size. This is a well-studied problem in case the samples are independent and identically distributed (i.e., when standard Monte Carlo simulation is used); he...

متن کامل

Piecewise linear density estimation for sampled data

Abstract – Nonparametric density estimation is considered for a discretely observed stationary continuous-time process. For each of three given time sampling procedures either random or deterministic, we establish that histograms and frequency polygons can reach the same optimal L2-rates as in the independent and identically distributed case. Moreover, thanks to a suitable “high frequency” samp...

متن کامل

Regression calibration in semiparametric accelerated failure time models.

In large cohort studies, it often happens that some covariates are expensive to measure and hence only measured on a validation set. On the other hand, relatively cheap but error-prone measurements of the covariates are available for all subjects. Regression calibration (RC) estimation method (Prentice, 1982, Biometrika 69, 331-342) is a popular method for analyzing such data and has been appli...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007